Term Recognition and Classi cation in Biological Science
نویسندگان
چکیده
In this paper we describe the application of automatic terminology recognition and classi-cation techniques for two bioinformatics projects: extraction of information about enzymes and metabolic pathways and extraction of information about protein structure, in both cases from scientiic journal papers. The techniques we use were adapted from already available Information Extraction (IE) technology that was developed for name recognition tasks deened in the US Message Understanding Conferences (MUC). Preliminary evaluation results of the terminological identiication components of our systems are quite encouraging and compare favourably with the state-of-the-art results in the MUC competitions. Further, this technology is quite generic and may be readily adapted to other subdomains in biological and medical science. Indeed, we are optimistic that the same methods will be applicable to term identiica-tion tasks in any technical domain.
منابع مشابه
Classifying Technical Terms
Automating the process of term recognition and classi cation is important for digital libraries. Automatic Term Recognition (ATR) has many applications in areas related to digital libraries, e.g. information retrieval and extraction from the web, summarisation, machine translation, dictionary construction etc. Automatic term classi cation attracts the interest of researchers following the steps...
متن کاملA fuzzy hyperspectral classi®er for automatic target recognition (ATR) systems
In this paper we present a fuzzy system based hyperspectral classi®er for automatic target identi®cation. The system is based on partitioning the spectral band space into clusters using a modi®ed fuzzy C-Means clustering algorithm. Classi®cation of each pixel is then carried out by calculating its fuzzy membership in each cluster. The results showed that the fuzzy hyperspectral classi®er is suc...
متن کاملA fuzzy hyperspectral classifier for automatic target recognition (ATR) systems
In this paper we present a fuzzy system based hyperspectral classi®er for automatic target identi®cation. The system is based on partitioning the spectral band space into clusters using a modi®ed fuzzy C-Means clustering algorithm. Classi®cation of each pixel is then carried out by calculating its fuzzy membership in each cluster. The results showed that the fuzzy hyperspectral classi®er is suc...
متن کاملMultiple Attribute Learning with Canonical Correlation Analysis and Em Algorithm
This paper presents a new framework of learning pattern recognition, called \multiple attribute learning". In usual setting of pattern recognition, target patterns have several attribute such as color, size, shape, and we can classify the patterns in several ways by their color, by their size, or by their shape. in normal pattern recognition problem, an attribute or a mode of classi cation is c...
متن کاملSymbolic Nearest Mean Classiiers
Piew Datta and Dennis Kibler Department of Information and Computer Science University of California Irvine, CA 92717 fpdatta, [email protected] Abstract The minimum-distance classi er summarizes each class with a prototype and then uses a nearest neighbor approach for classi cation. Three drawbacks of the minimum-distance classi er are its inability to work with symbolic attributes, weigh at...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000